Picture for Mingjie Zhan

Mingjie Zhan

FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation

Add code
Feb 03, 2026
Viaarxiv icon

SlidesGen-Bench: Evaluating Slides Generation via Computational and Quantitative Metrics

Add code
Jan 14, 2026
Viaarxiv icon

WebGen-Agent: Enhancing Interactive Website Generation with Multi-Level Feedback and Step-Level Reinforcement Learning

Add code
Sep 26, 2025
Viaarxiv icon

VoiceAssistant-Eval: Benchmarking AI Assistants across Listening, Speaking, and Viewing

Add code
Sep 26, 2025
Viaarxiv icon

Alignment with Fill-In-the-Middle for Enhancing Code Generation

Add code
Aug 27, 2025
Viaarxiv icon

Probability-Consistent Preference Optimization for Enhanced LLM Reasoning

Add code
May 29, 2025
Viaarxiv icon

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Add code
May 15, 2025
Viaarxiv icon

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

Add code
May 06, 2025
Figure 1 for WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch
Figure 2 for WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch
Figure 3 for WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch
Figure 4 for WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch
Viaarxiv icon

Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up

Add code
Mar 31, 2025
Figure 1 for Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up
Figure 2 for Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up
Figure 3 for Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up
Figure 4 for Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up
Viaarxiv icon

SpiritSight Agent: Advanced GUI Agent with One Look

Add code
Mar 05, 2025
Viaarxiv icon